Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 500 |
| Missing cells | 2716 |
| Missing cells (%) | 28.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 74.3 KiB |
| Average record size in memory | 152.3 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 7 |
| Boolean | 1 |
EFF_DT has a high cardinality: 482 distinct values | High cardinality |
BKG_DT has a high cardinality: 485 distinct values | High cardinality |
FNCT_CCY has a high cardinality: 144 distinct values | High cardinality |
TXN_CCY has a high cardinality: 143 distinct values | High cardinality |
SRC_SYS_ID has 93 (18.6%) missing values | Missing |
FRS_BU has 133 (26.6%) missing values | Missing |
FRS_AFFL_CD has 170 (34.0%) missing values | Missing |
ACTG_UNIT_ID has 212 (42.4%) missing values | Missing |
GOC has 288 (57.6%) missing values | Missing |
MNGD_SEG has 169 (33.8%) missing values | Missing |
BASE_CCY_AMT has 259 (51.8%) missing values | Missing |
FNCT_CCY_AMT has 137 (27.4%) missing values | Missing |
ENTRPS_PROD_CD has 136 (27.2%) missing values | Missing |
TXN_CCY_AMT has 50 (10.0%) missing values | Missing |
CITI_LV has 195 (39.0%) missing values | Missing |
ib_flag has 149 (29.8%) missing values | Missing |
segr_flag has 138 (27.6%) missing values | Missing |
FRS_ACCOUNT_CLASS has 163 (32.6%) missing values | Missing |
GAAP_TYP_CD has 132 (26.4%) missing values | Missing |
FNCT_CCY has 144 (28.8%) missing values | Missing |
TXN_CCY has 148 (29.6%) missing values | Missing |
EFF_DT is uniformly distributed | Uniform |
BKG_DT is uniformly distributed | Uniform |
FNCT_CCY is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2023-05-04 17:09:40.551470 |
|---|---|
| Analysis finished | 2023-05-04 17:10:24.986073 |
| Duration | 44.43 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
SRC_SYS_ID
Real number (ℝ)
| Distinct | 407 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 93 |
| Missing (%) | 18.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88536.511 |
| Minimum | 30299 |
|---|---|
| Maximum | 149907 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 30299 |
|---|---|
| 5-th percentile | 35981.5 |
| Q1 | 57822.5 |
| median | 88834 |
| Q3 | 116933 |
| 95-th percentile | 141242 |
| Maximum | 149907 |
| Range | 119608 |
| Interquartile range (IQR) | 59110.5 |
Descriptive statistics
| Standard deviation | 34736.203 |
|---|---|
| Coefficient of variation (CV) | 0.39233761 |
| Kurtosis | -1.2035208 |
| Mean | 88536.511 |
| Median Absolute Deviation (MAD) | 29590 |
| Skewness | 0.017185501 |
| Sum | 36034360 |
| Variance | 1.2066038 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 89377 | 1 | 0.2% |
| 137820 | 1 | 0.2% |
| 58241 | 1 | 0.2% |
| 100403 | 1 | 0.2% |
| 133132 | 1 | 0.2% |
| 137737 | 1 | 0.2% |
| 78707 | 1 | 0.2% |
| 140196 | 1 | 0.2% |
| 96161 | 1 | 0.2% |
| 88006 | 1 | 0.2% |
| Other values (397) | 397 | |
| (Missing) | 93 | 18.6% |
| Value | Count | Frequency (%) |
| 30299 | 1 | |
| 30470 | 1 | |
| 30882 | 1 | |
| 30981 | 1 | |
| 31219 | 1 | |
| 31237 | 1 | |
| 31523 | 1 | |
| 31584 | 1 | |
| 31783 | 1 | |
| 32104 | 1 |
| Value | Count | Frequency (%) |
| 149907 | 1 | |
| 149779 | 1 | |
| 149604 | 1 | |
| 149383 | 1 | |
| 147888 | 1 | |
| 147871 | 1 | |
| 147401 | 1 | |
| 147226 | 1 | |
| 146506 | 1 | |
| 146206 | 1 |
FRS_BU
Real number (ℝ)
| Distinct | 367 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 133 |
| Missing (%) | 26.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31018.918 |
| Minimum | 11038 |
|---|---|
| Maximum | 49979 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 11038 |
|---|---|
| 5-th percentile | 12204.9 |
| Q1 | 21045 |
| median | 30548 |
| Q3 | 40637 |
| 95-th percentile | 48244.6 |
| Maximum | 49979 |
| Range | 38941 |
| Interquartile range (IQR) | 19592 |
Descriptive statistics
| Standard deviation | 11460.234 |
|---|---|
| Coefficient of variation (CV) | 0.36945949 |
| Kurtosis | -1.2103425 |
| Mean | 31018.918 |
| Median Absolute Deviation (MAD) | 9829 |
| Skewness | -0.036353792 |
| Sum | 11383943 |
| Variance | 1.3133696 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37123 | 1 | 0.2% |
| 35646 | 1 | 0.2% |
| 44096 | 1 | 0.2% |
| 11117 | 1 | 0.2% |
| 48190 | 1 | 0.2% |
| 11687 | 1 | 0.2% |
| 47107 | 1 | 0.2% |
| 12153 | 1 | 0.2% |
| 30627 | 1 | 0.2% |
| 44463 | 1 | 0.2% |
| Other values (357) | 357 | |
| (Missing) | 133 | 26.6% |
| Value | Count | Frequency (%) |
| 11038 | 1 | |
| 11117 | 1 | |
| 11156 | 1 | |
| 11355 | 1 | |
| 11433 | 1 | |
| 11634 | 1 | |
| 11645 | 1 | |
| 11671 | 1 | |
| 11680 | 1 | |
| 11687 | 1 |
| Value | Count | Frequency (%) |
| 49979 | 1 | |
| 49566 | 1 | |
| 49485 | 1 | |
| 49436 | 1 | |
| 49435 | 1 | |
| 49163 | 1 | |
| 49008 | 1 | |
| 48868 | 1 | |
| 48741 | 1 | |
| 48723 | 1 |
FRS_AFFL_CD
Real number (ℝ)
| Distinct | 330 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 170 |
| Missing (%) | 34.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13014.6 |
| Minimum | 11013 |
|---|---|
| Maximum | 14989 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 11013 |
|---|---|
| 5-th percentile | 11291.95 |
| Q1 | 12041.25 |
| median | 12890.5 |
| Q3 | 14027.75 |
| 95-th percentile | 14758.55 |
| Maximum | 14989 |
| Range | 3976 |
| Interquartile range (IQR) | 1986.5 |
Descriptive statistics
| Standard deviation | 1145.9344 |
|---|---|
| Coefficient of variation (CV) | 0.088049914 |
| Kurtosis | -1.2239847 |
| Mean | 13014.6 |
| Median Absolute Deviation (MAD) | 1015.5 |
| Skewness | 0.046990188 |
| Sum | 4294818 |
| Variance | 1313165.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14356 | 1 | 0.2% |
| 11877 | 1 | 0.2% |
| 12748 | 1 | 0.2% |
| 14292 | 1 | 0.2% |
| 14502 | 1 | 0.2% |
| 14015 | 1 | 0.2% |
| 14230 | 1 | 0.2% |
| 14528 | 1 | 0.2% |
| 12193 | 1 | 0.2% |
| 13352 | 1 | 0.2% |
| Other values (320) | 320 | |
| (Missing) | 170 |
| Value | Count | Frequency (%) |
| 11013 | 1 | |
| 11034 | 1 | |
| 11043 | 1 | |
| 11053 | 1 | |
| 11075 | 1 | |
| 11082 | 1 | |
| 11092 | 1 | |
| 11112 | 1 | |
| 11125 | 1 | |
| 11145 | 1 |
| Value | Count | Frequency (%) |
| 14989 | 1 | |
| 14979 | 1 | |
| 14961 | 1 | |
| 14950 | 1 | |
| 14947 | 1 | |
| 14939 | 1 | |
| 14935 | 1 | |
| 14906 | 1 | |
| 14895 | 1 | |
| 14888 | 1 |
ACTG_UNIT_ID
Real number (ℝ)
| Distinct | 288 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 212 |
| Missing (%) | 42.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2575.8507 |
| Minimum | 41 |
|---|---|
| Maximum | 4999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 41 |
|---|---|
| 5-th percentile | 242.5 |
| Q1 | 1367.25 |
| median | 2572.5 |
| Q3 | 3806 |
| 95-th percentile | 4788.5 |
| Maximum | 4999 |
| Range | 4958 |
| Interquartile range (IQR) | 2438.75 |
Descriptive statistics
| Standard deviation | 1422.4635 |
|---|---|
| Coefficient of variation (CV) | 0.55223057 |
| Kurtosis | -1.1000743 |
| Mean | 2575.8507 |
| Median Absolute Deviation (MAD) | 1229.5 |
| Skewness | -0.045873221 |
| Sum | 741845 |
| Variance | 2023402.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1971 | 1 | 0.2% |
| 3149 | 1 | 0.2% |
| 1906 | 1 | 0.2% |
| 249 | 1 | 0.2% |
| 3887 | 1 | 0.2% |
| 321 | 1 | 0.2% |
| 3504 | 1 | 0.2% |
| 187 | 1 | 0.2% |
| 1190 | 1 | 0.2% |
| 4595 | 1 | 0.2% |
| Other values (278) | 278 | |
| (Missing) | 212 |
| Value | Count | Frequency (%) |
| 41 | 1 | |
| 59 | 1 | |
| 71 | 1 | |
| 81 | 1 | |
| 85 | 1 | |
| 93 | 1 | |
| 94 | 1 | |
| 97 | 1 | |
| 155 | 1 | |
| 162 | 1 |
| Value | Count | Frequency (%) |
| 4999 | 1 | |
| 4998 | 1 | |
| 4978 | 1 | |
| 4970 | 1 | |
| 4945 | 1 | |
| 4940 | 1 | |
| 4912 | 1 | |
| 4897 | 1 | |
| 4886 | 1 | |
| 4868 | 1 |
GOC
Real number (ℝ)
| Distinct | 212 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 288 |
| Missing (%) | 57.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0235358 × 108 |
| Minimum | 12480153 |
|---|---|
| Maximum | 5.8692296 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 12480153 |
|---|---|
| 5-th percentile | 12481000 |
| Q1 | 12484806 |
| median | 2.3940403 × 108 |
| Q3 | 2.3941158 × 108 |
| 95-th percentile | 5.8692249 × 108 |
| Maximum | 5.8692296 × 108 |
| Range | 5.744428 × 108 |
| Interquartile range (IQR) | 2.2692677 × 108 |
Descriptive statistics
| Standard deviation | 2.2067353 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.0905343 |
| Kurtosis | -0.79144914 |
| Mean | 2.0235358 × 108 |
| Median Absolute Deviation (MAD) | 2.2691833 × 108 |
| Skewness | 0.80483348 |
| Sum | 4.289896 × 1010 |
| Variance | 4.8696807 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 239408378 | 1 | 0.2% |
| 239404468 | 1 | 0.2% |
| 239411299 | 1 | 0.2% |
| 239406851 | 1 | 0.2% |
| 239413136 | 1 | 0.2% |
| 239409822 | 1 | 0.2% |
| 239404106 | 1 | 0.2% |
| 239409196 | 1 | 0.2% |
| 239408285 | 1 | 0.2% |
| 239407546 | 1 | 0.2% |
| Other values (202) | 202 | |
| (Missing) | 288 |
| Value | Count | Frequency (%) |
| 12480153 | 1 | |
| 12480212 | 1 | |
| 12480296 | 1 | |
| 12480398 | 1 | |
| 12480411 | 1 | |
| 12480530 | 1 | |
| 12480555 | 1 | |
| 12480585 | 1 | |
| 12480654 | 1 | |
| 12480667 | 1 |
| Value | Count | Frequency (%) |
| 586922958 | 1 | |
| 586922913 | 1 | |
| 586922893 | 1 | |
| 586922777 | 1 | |
| 586922729 | 1 | |
| 586922643 | 1 | |
| 586922623 | 1 | |
| 586922586 | 1 | |
| 586922579 | 1 | |
| 586922559 | 1 |
MNGD_SEG
Real number (ℝ)
| Distinct | 331 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 169 |
| Missing (%) | 33.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37443.166 |
| Minimum | 3376 |
|---|---|
| Maximum | 74921 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 3376 |
|---|---|
| 5-th percentile | 5591 |
| Q1 | 18208 |
| median | 35568 |
| Q3 | 56816.5 |
| 95-th percentile | 71595 |
| Maximum | 74921 |
| Range | 71545 |
| Interquartile range (IQR) | 38608.5 |
Descriptive statistics
| Standard deviation | 21397.916 |
|---|---|
| Coefficient of variation (CV) | 0.57147722 |
| Kurtosis | -1.2602316 |
| Mean | 37443.166 |
| Median Absolute Deviation (MAD) | 19293 |
| Skewness | 0.0760918 |
| Sum | 12393688 |
| Variance | 4.5787083 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 54926 | 1 | 0.2% |
| 71451 | 1 | 0.2% |
| 42477 | 1 | 0.2% |
| 33567 | 1 | 0.2% |
| 48742 | 1 | 0.2% |
| 31352 | 1 | 0.2% |
| 62363 | 1 | 0.2% |
| 4314 | 1 | 0.2% |
| 27573 | 1 | 0.2% |
| 27188 | 1 | 0.2% |
| Other values (321) | 321 | |
| (Missing) | 169 |
| Value | Count | Frequency (%) |
| 3376 | 1 | |
| 3470 | 1 | |
| 3597 | 1 | |
| 3699 | 1 | |
| 4205 | 1 | |
| 4314 | 1 | |
| 4754 | 1 | |
| 4766 | 1 | |
| 4847 | 1 | |
| 4853 | 1 |
| Value | Count | Frequency (%) |
| 74921 | 1 | |
| 74840 | 1 | |
| 74714 | 1 | |
| 74681 | 1 | |
| 74305 | 1 | |
| 74228 | 1 | |
| 73936 | 1 | |
| 73879 | 1 | |
| 73465 | 1 | |
| 73209 | 1 |
BASE_CCY_AMT
Real number (ℝ)
| Distinct | 241 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 259 |
| Missing (%) | 51.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18987.895 |
| Minimum | -288533.95 |
|---|---|
| Maximum | 271052.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 97 |
| Negative (%) | 19.4% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | -288533.95 |
|---|---|
| 5-th percentile | -130837.1 |
| Q1 | -37570.492 |
| median | 17343.536 |
| Q3 | 79620.254 |
| 95-th percentile | 173503.06 |
| Maximum | 271052.1 |
| Range | 559586.06 |
| Interquartile range (IQR) | 117190.75 |
Descriptive statistics
| Standard deviation | 93599.776 |
|---|---|
| Coefficient of variation (CV) | 4.9294445 |
| Kurtosis | 0.40667477 |
| Mean | 18987.895 |
| Median Absolute Deviation (MAD) | 58362.134 |
| Skewness | -0.18049233 |
| Sum | 4576082.7 |
| Variance | 8.760918 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -44833.413 | 1 | 0.2% |
| 2930.122 | 1 | 0.2% |
| 31282.159 | 1 | 0.2% |
| 134385.318 | 1 | 0.2% |
| 118093.832 | 1 | 0.2% |
| -118718.714 | 1 | 0.2% |
| 105061.914 | 1 | 0.2% |
| -20758.632 | 1 | 0.2% |
| -55830.93 | 1 | 0.2% |
| 203307.055 | 1 | 0.2% |
| Other values (231) | 231 | |
| (Missing) | 259 |
| Value | Count | Frequency (%) |
| -288533.954 | 1 | |
| -268590.032 | 1 | |
| -244230.521 | 1 | |
| -208253.208 | 1 | |
| -192804.416 | 1 | |
| -168063.322 | 1 | |
| -156592.471 | 1 | |
| -155096.867 | 1 | |
| -152737.846 | 1 | |
| -150431.197 | 1 |
| Value | Count | Frequency (%) |
| 271052.103 | 1 | |
| 240732.589 | 1 | |
| 221636.604 | 1 | |
| 219576.436 | 1 | |
| 210386.764 | 1 | |
| 205815.698 | 1 | |
| 203307.055 | 1 | |
| 195558.866 | 1 | |
| 191006.462 | 1 | |
| 180015.824 | 1 |
FNCT_CCY_AMT
Real number (ℝ)
| Distinct | 363 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 137 |
| Missing (%) | 27.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 460206.98 |
| Minimum | -2434033.8 |
|---|---|
| Maximum | 3160114.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 124 |
| Negative (%) | 24.8% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | -2434033.8 |
|---|---|
| 5-th percentile | -1138222.3 |
| Q1 | -239795.77 |
| median | 420493 |
| Q3 | 1150948.7 |
| 95-th percentile | 2180152.1 |
| Maximum | 3160114.6 |
| Range | 5594148.4 |
| Interquartile range (IQR) | 1390744.4 |
Descriptive statistics
| Standard deviation | 995168.71 |
|---|---|
| Coefficient of variation (CV) | 2.1624372 |
| Kurtosis | -0.23109228 |
| Mean | 460206.98 |
| Median Absolute Deviation (MAD) | 684301.04 |
| Skewness | 0.081152121 |
| Sum | 1.6705514 × 108 |
| Variance | 9.9036076 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -15523.959 | 1 | 0.2% |
| 247430.22 | 1 | 0.2% |
| -1342161.295 | 1 | 0.2% |
| -562170.904 | 1 | 0.2% |
| -511066.453 | 1 | 0.2% |
| -239309.644 | 1 | 0.2% |
| 358194.845 | 1 | 0.2% |
| 158024.849 | 1 | 0.2% |
| 1332562.018 | 1 | 0.2% |
| 1042262.496 | 1 | 0.2% |
| Other values (353) | 353 | |
| (Missing) | 137 | 27.4% |
| Value | Count | Frequency (%) |
| -2434033.762 | 1 | |
| -2140647.272 | 1 | |
| -1916794.405 | 1 | |
| -1726109.593 | 1 | |
| -1647295.471 | 1 | |
| -1600178.452 | 1 | |
| -1578609.913 | 1 | |
| -1475017.332 | 1 | |
| -1460906.872 | 1 | |
| -1443764.842 | 1 |
| Value | Count | Frequency (%) |
| 3160114.597 | 1 | |
| 2972165.912 | 1 | |
| 2836324.624 | 1 | |
| 2808438.736 | 1 | |
| 2711341.818 | 1 | |
| 2630543.318 | 1 | |
| 2558025.202 | 1 | |
| 2512439.158 | 1 | |
| 2438191.854 | 1 | |
| 2423255.847 | 1 |
ENTRPS_PROD_CD
Real number (ℝ)
| Distinct | 232 |
|---|---|
| Distinct (%) | 63.7% |
| Missing | 136 |
| Missing (%) | 27.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1307.9533 |
| Minimum | 1102 |
|---|---|
| Maximum | 1498 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 1102 |
|---|---|
| 5-th percentile | 1127.15 |
| Q1 | 1212 |
| median | 1312 |
| Q3 | 1404 |
| 95-th percentile | 1483 |
| Maximum | 1498 |
| Range | 396 |
| Interquartile range (IQR) | 192 |
Descriptive statistics
| Standard deviation | 114.51029 |
|---|---|
| Coefficient of variation (CV) | 0.08754922 |
| Kurtosis | -1.1631046 |
| Mean | 1307.9533 |
| Median Absolute Deviation (MAD) | 95.5 |
| Skewness | -0.077027785 |
| Sum | 476095 |
| Variance | 13112.607 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1242 | 5 | 1.0% |
| 1312 | 4 | 0.8% |
| 1110 | 4 | 0.8% |
| 1380 | 4 | 0.8% |
| 1489 | 4 | 0.8% |
| 1493 | 4 | 0.8% |
| 1369 | 4 | 0.8% |
| 1362 | 4 | 0.8% |
| 1308 | 3 | 0.6% |
| 1370 | 3 | 0.6% |
| Other values (222) | 325 | |
| (Missing) | 136 |
| Value | Count | Frequency (%) |
| 1102 | 1 | 0.2% |
| 1103 | 2 | |
| 1105 | 1 | 0.2% |
| 1106 | 3 | |
| 1108 | 1 | 0.2% |
| 1109 | 1 | 0.2% |
| 1110 | 4 | |
| 1111 | 1 | 0.2% |
| 1113 | 1 | 0.2% |
| 1118 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 1498 | 1 | 0.2% |
| 1497 | 2 | |
| 1496 | 1 | 0.2% |
| 1495 | 1 | 0.2% |
| 1493 | 4 | |
| 1491 | 1 | 0.2% |
| 1490 | 2 | |
| 1489 | 4 | |
| 1488 | 1 | 0.2% |
| 1485 | 1 | 0.2% |
TXN_CCY_AMT
Real number (ℝ)
| Distinct | 450 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 50 |
| Missing (%) | 10.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 154316.31 |
| Minimum | -531204.64 |
|---|---|
| Maximum | 865135.14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 125 |
| Negative (%) | 25.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | -531204.64 |
|---|---|
| 5-th percentile | -229415.38 |
| Q1 | -21573.834 |
| median | 154039.74 |
| Q3 | 324141.06 |
| 95-th percentile | 550578.85 |
| Maximum | 865135.14 |
| Range | 1396339.8 |
| Interquartile range (IQR) | 345714.89 |
Descriptive statistics
| Standard deviation | 237797.34 |
|---|---|
| Coefficient of variation (CV) | 1.5409735 |
| Kurtosis | -0.3227069 |
| Mean | 154316.31 |
| Median Absolute Deviation (MAD) | 173874.61 |
| Skewness | 0.025537128 |
| Sum | 69442339 |
| Variance | 5.6547575 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -34587.403 | 1 | 0.2% |
| 536743.691 | 1 | 0.2% |
| 403297.51 | 1 | 0.2% |
| 205937.747 | 1 | 0.2% |
| 173717.56 | 1 | 0.2% |
| 430468.839 | 1 | 0.2% |
| -73795.275 | 1 | 0.2% |
| 34673.308 | 1 | 0.2% |
| -34679.051 | 1 | 0.2% |
| 553729.9 | 1 | 0.2% |
| Other values (440) | 440 | |
| (Missing) | 50 | 10.0% |
| Value | Count | Frequency (%) |
| -531204.639 | 1 | |
| -418197.725 | 1 | |
| -389183.76 | 1 | |
| -365002.678 | 1 | |
| -359197.623 | 1 | |
| -353324.042 | 1 | |
| -352708.853 | 1 | |
| -345272.1 | 1 | |
| -340836.678 | 1 | |
| -340386.08 | 1 |
| Value | Count | Frequency (%) |
| 865135.144 | 1 | |
| 759775.446 | 1 | |
| 737590.109 | 1 | |
| 727082.396 | 1 | |
| 675521.854 | 1 | |
| 658324.876 | 1 | |
| 655198.127 | 1 | |
| 653214.353 | 1 | |
| 647934.005 | 1 | |
| 633579.62 | 1 |
CITI_LV
Real number (ℝ)
| Distinct | 299 |
|---|---|
| Distinct (%) | 98.0% |
| Missing | 195 |
| Missing (%) | 39.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15129.662 |
| Minimum | 11011 |
|---|---|
| Maximum | 18995 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 11011 |
|---|---|
| 5-th percentile | 11380 |
| Q1 | 12997 |
| median | 15256 |
| Q3 | 17245 |
| 95-th percentile | 18739.2 |
| Maximum | 18995 |
| Range | 7984 |
| Interquartile range (IQR) | 4248 |
Descriptive statistics
| Standard deviation | 2385.4398 |
|---|---|
| Coefficient of variation (CV) | 0.15766643 |
| Kurtosis | -1.231152 |
| Mean | 15129.662 |
| Median Absolute Deviation (MAD) | 2086 |
| Skewness | -0.085699951 |
| Sum | 4614547 |
| Variance | 5690322.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13082 | 2 | 0.4% |
| 11662 | 2 | 0.4% |
| 18047 | 2 | 0.4% |
| 18893 | 2 | 0.4% |
| 17322 | 2 | 0.4% |
| 16229 | 2 | 0.4% |
| 18887 | 1 | 0.2% |
| 15948 | 1 | 0.2% |
| 16459 | 1 | 0.2% |
| 16522 | 1 | 0.2% |
| Other values (289) | 289 | |
| (Missing) | 195 |
| Value | Count | Frequency (%) |
| 11011 | 1 | |
| 11028 | 1 | |
| 11044 | 1 | |
| 11063 | 1 | |
| 11065 | 1 | |
| 11104 | 1 | |
| 11125 | 1 | |
| 11155 | 1 | |
| 11166 | 1 | |
| 11188 | 1 |
| Value | Count | Frequency (%) |
| 18995 | 1 | |
| 18897 | 1 | |
| 18894 | 1 | |
| 18893 | 2 | |
| 18887 | 1 | |
| 18884 | 1 | |
| 18872 | 1 | |
| 18861 | 1 | |
| 18795 | 1 | |
| 18794 | 1 |
| Distinct | 482 |
|---|---|
| Distinct (%) | 96.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.0 KiB |
| 12/03/2018 | 2 |
|---|---|
| 03/04/2004 | 2 |
| 02/10/2017 | 2 |
| 10/09/2003 | 2 |
| 04/13/2003 | 2 |
| Other values (477) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 5000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 464 ? |
|---|---|
| Unique (%) | 92.8% |
Sample
| 1st row | 01/14/2011 |
|---|---|
| 2nd row | 12/12/2007 |
| 3rd row | 10/15/2008 |
| 4th row | 09/12/1999 |
| 5th row | 08/05/2010 |
Common Values
| Value | Count | Frequency (%) |
| 12/03/2018 | 2 | 0.4% |
| 03/04/2004 | 2 | 0.4% |
| 02/10/2017 | 2 | 0.4% |
| 10/09/2003 | 2 | 0.4% |
| 04/13/2003 | 2 | 0.4% |
| 01/26/2021 | 2 | 0.4% |
| 07/17/2014 | 2 | 0.4% |
| 08/12/2016 | 2 | 0.4% |
| 09/14/2019 | 2 | 0.4% |
| 01/12/1997 | 2 | 0.4% |
| Other values (472) | 480 |
Length
| Value | Count | Frequency (%) |
| 12/03/2018 | 2 | 0.4% |
| 03/18/2017 | 2 | 0.4% |
| 03/04/2004 | 2 | 0.4% |
| 06/15/2015 | 2 | 0.4% |
| 04/27/2001 | 2 | 0.4% |
| 01/17/2010 | 2 | 0.4% |
| 02/06/2019 | 2 | 0.4% |
| 02/27/2013 | 2 | 0.4% |
| 07/26/2018 | 2 | 0.4% |
| 03/30/2008 | 2 | 0.4% |
| Other values (472) | 480 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1294 | |
| / | 1000 | |
| 2 | 831 | |
| 1 | 785 | |
| 9 | 267 | 5.3% |
| 7 | 157 | 3.1% |
| 8 | 156 | 3.1% |
| 3 | 152 | 3.0% |
| 6 | 132 | 2.6% |
| 5 | 124 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4000 | |
| Other Punctuation | 1000 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1294 | |
| 2 | 831 | |
| 1 | 785 | |
| 9 | 267 | 6.7% |
| 7 | 157 | 3.9% |
| 8 | 156 | 3.9% |
| 3 | 152 | 3.8% |
| 6 | 132 | 3.3% |
| 5 | 124 | 3.1% |
| 4 | 102 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1294 | |
| / | 1000 | |
| 2 | 831 | |
| 1 | 785 | |
| 9 | 267 | 5.3% |
| 7 | 157 | 3.1% |
| 8 | 156 | 3.1% |
| 3 | 152 | 3.0% |
| 6 | 132 | 2.6% |
| 5 | 124 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1294 | |
| / | 1000 | |
| 2 | 831 | |
| 1 | 785 | |
| 9 | 267 | 5.3% |
| 7 | 157 | 3.1% |
| 8 | 156 | 3.1% |
| 3 | 152 | 3.0% |
| 6 | 132 | 2.6% |
| 5 | 124 | 2.5% |
| Distinct | 485 |
|---|---|
| Distinct (%) | 97.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.0 KiB |
| 03/10/2011 | 2 |
|---|---|
| 05/27/2005 | 2 |
| 01/22/1999 | 2 |
| 01/08/2003 | 2 |
| 05/25/2010 | 2 |
| Other values (480) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 5000 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 470 ? |
|---|---|
| Unique (%) | 94.0% |
Sample
| 1st row | 03/10/2011 |
|---|---|
| 2nd row | 11/12/2007 |
| 3rd row | 10/05/2008 |
| 4th row | 09/09/1999 |
| 5th row | 08/11/2010 |
Common Values
| Value | Count | Frequency (%) |
| 03/10/2011 | 2 | 0.4% |
| 05/27/2005 | 2 | 0.4% |
| 01/22/1999 | 2 | 0.4% |
| 01/08/2003 | 2 | 0.4% |
| 05/25/2010 | 2 | 0.4% |
| 03/23/2015 | 2 | 0.4% |
| 03/04/2017 | 2 | 0.4% |
| 12/05/2014 | 2 | 0.4% |
| 09/18/2008 | 2 | 0.4% |
| 01/14/2011 | 2 | 0.4% |
| Other values (475) | 480 |
Length
| Value | Count | Frequency (%) |
| 03/10/2011 | 2 | 0.4% |
| 09/18/2008 | 2 | 0.4% |
| 05/27/2005 | 2 | 0.4% |
| 09/14/2007 | 2 | 0.4% |
| 08/01/2019 | 2 | 0.4% |
| 01/12/2017 | 2 | 0.4% |
| 12/18/1997 | 2 | 0.4% |
| 01/14/2011 | 2 | 0.4% |
| 08/14/2020 | 2 | 0.4% |
| 12/05/2014 | 2 | 0.4% |
| Other values (475) | 480 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1300 | |
| / | 1000 | |
| 2 | 844 | |
| 1 | 772 | |
| 9 | 260 | 5.2% |
| 8 | 152 | 3.0% |
| 3 | 149 | 3.0% |
| 7 | 149 | 3.0% |
| 5 | 135 | 2.7% |
| 4 | 122 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4000 | |
| Other Punctuation | 1000 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1300 | |
| 2 | 844 | |
| 1 | 772 | |
| 9 | 260 | 6.5% |
| 8 | 152 | 3.8% |
| 3 | 149 | 3.7% |
| 7 | 149 | 3.7% |
| 5 | 135 | 3.4% |
| 4 | 122 | 3.0% |
| 6 | 117 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1300 | |
| / | 1000 | |
| 2 | 844 | |
| 1 | 772 | |
| 9 | 260 | 5.2% |
| 8 | 152 | 3.0% |
| 3 | 149 | 3.0% |
| 7 | 149 | 3.0% |
| 5 | 135 | 2.7% |
| 4 | 122 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1300 | |
| / | 1000 | |
| 2 | 844 | |
| 1 | 772 | |
| 9 | 260 | 5.2% |
| 8 | 152 | 3.0% |
| 3 | 149 | 3.0% |
| 7 | 149 | 3.0% |
| 5 | 135 | 2.7% |
| 4 | 122 | 2.4% |
ib_flag
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 149 |
| Missing (%) | 29.8% |
| Memory size | 1.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 176 | |
| True | 175 | |
| (Missing) | 149 |
segr_flag
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 138 |
| Missing (%) | 27.6% |
| Memory size | 4.0 KiB |
| Y | |
|---|---|
| N | |
| 3 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0082873 |
| Min length | 1 |
Characters and Unicode
| Total characters | 365 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Y |
|---|---|
| 2nd row | |
| 3rd row | N |
| 4th row | N |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| Y | 190 | |
| N | 169 | |
| 3 | 0.6% | |
| (Missing) | 138 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| y | 190 | |
| n | 169 |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 190 | |
| N | 169 | |
| 6 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 359 | |
| Space Separator | 6 | 1.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 190 | |
| N | 169 |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 359 | |
| Common | 6 | 1.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 190 | |
| N | 169 |
Common
| Value | Count | Frequency (%) |
| 6 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 365 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 190 | |
| N | 169 | |
| 6 | 1.6% |
FRS_ACCOUNT_CLASS
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 163 |
| Missing (%) | 32.6% |
| Memory size | 4.0 KiB |
| CASH | |
|---|---|
| OVDFT-LIAB |
Length
| Max length | 10 |
|---|---|
| Median length | 4 |
| Mean length | 6.9376855 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2338 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OVDFT-LIAB |
|---|---|
| 2nd row | CASH |
| 3rd row | OVDFT-LIAB |
| 4th row | CASH |
| 5th row | OVDFT-LIAB |
Common Values
| Value | Count | Frequency (%) |
| CASH | 172 | |
| OVDFT-LIAB | 165 | |
| (Missing) | 163 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cash | 172 | |
| ovdft-liab | 165 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 337 | |
| C | 172 | 7.4% |
| S | 172 | 7.4% |
| H | 172 | 7.4% |
| O | 165 | 7.1% |
| V | 165 | 7.1% |
| D | 165 | 7.1% |
| F | 165 | 7.1% |
| T | 165 | 7.1% |
| - | 165 | 7.1% |
| Other values (3) | 495 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2173 | |
| Dash Punctuation | 165 | 7.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 337 | |
| C | 172 | |
| S | 172 | |
| H | 172 | |
| O | 165 | |
| V | 165 | |
| D | 165 | |
| F | 165 | |
| T | 165 | |
| L | 165 | |
| Other values (2) | 330 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 165 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2173 | |
| Common | 165 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 337 | |
| C | 172 | |
| S | 172 | |
| H | 172 | |
| O | 165 | |
| V | 165 | |
| D | 165 | |
| F | 165 | |
| T | 165 | |
| L | 165 | |
| Other values (2) | 330 |
Common
| Value | Count | Frequency (%) |
| - | 165 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2338 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 337 | |
| C | 172 | 7.4% |
| S | 172 | 7.4% |
| H | 172 | 7.4% |
| O | 165 | 7.1% |
| V | 165 | 7.1% |
| D | 165 | 7.1% |
| F | 165 | 7.1% |
| T | 165 | 7.1% |
| - | 165 | 7.1% |
| Other values (3) | 495 |
GAAP_TYP_CD
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 132 |
| Missing (%) | 26.4% |
| Memory size | 4.0 KiB |
| US_GAAP | |
|---|---|
| LCL_GAAP | |
| COM_GAAP | |
| I_GAAP |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.236413 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2663 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US_GAAP |
|---|---|
| 2nd row | US_GAAP |
| 3rd row | US_GAAP |
| 4th row | US_GAAP |
| 5th row | COM_GAAP |
Common Values
| Value | Count | Frequency (%) |
| US_GAAP | 149 | |
| LCL_GAAP | 78 | |
| COM_GAAP | 75 | |
| I_GAAP | 66 | |
| (Missing) | 132 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| us_gaap | 149 | |
| lcl_gaap | 78 | |
| com_gaap | 75 | |
| i_gaap | 66 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 736 | |
| _ | 368 | |
| G | 368 | |
| P | 368 | |
| L | 156 | 5.9% |
| C | 153 | 5.7% |
| U | 149 | 5.6% |
| S | 149 | 5.6% |
| O | 75 | 2.8% |
| M | 75 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2295 | |
| Connector Punctuation | 368 | 13.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 736 | |
| G | 368 | |
| P | 368 | |
| L | 156 | 6.8% |
| C | 153 | 6.7% |
| U | 149 | 6.5% |
| S | 149 | 6.5% |
| O | 75 | 3.3% |
| M | 75 | 3.3% |
| I | 66 | 2.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 368 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2295 | |
| Common | 368 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 736 | |
| G | 368 | |
| P | 368 | |
| L | 156 | 6.8% |
| C | 153 | 6.7% |
| U | 149 | 6.5% |
| S | 149 | 6.5% |
| O | 75 | 3.3% |
| M | 75 | 3.3% |
| I | 66 | 2.9% |
Common
| Value | Count | Frequency (%) |
| _ | 368 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 736 | |
| _ | 368 | |
| G | 368 | |
| P | 368 | |
| L | 156 | 5.9% |
| C | 153 | 5.7% |
| U | 149 | 5.6% |
| S | 149 | 5.6% |
| O | 75 | 2.8% |
| M | 75 | 2.8% |
| Distinct | 144 |
|---|---|
| Distinct (%) | 40.4% |
| Missing | 144 |
| Missing (%) | 28.8% |
| Memory size | 4.0 KiB |
| ZAR | 6 |
|---|---|
| IMP | 6 |
| SAR | 6 |
| ZWD | 6 |
| TZS | 5 |
| Other values (139) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1068 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | 10.1% |
Sample
| 1st row | GHS |
|---|---|
| 2nd row | BND |
| 3rd row | BND |
| 4th row | BBD |
| 5th row | MZN |
Common Values
| Value | Count | Frequency (%) |
| ZAR | 6 | 1.2% |
| IMP | 6 | 1.2% |
| SAR | 6 | 1.2% |
| ZWD | 6 | 1.2% |
| TZS | 5 | 1.0% |
| SZL | 5 | 1.0% |
| YER | 5 | 1.0% |
| BAM | 5 | 1.0% |
| COP | 5 | 1.0% |
| BBD | 5 | 1.0% |
| Other values (134) | 302 | |
| (Missing) | 144 |
Length
| Value | Count | Frequency (%) |
| zar | 6 | 1.7% |
| sar | 6 | 1.7% |
| zwd | 6 | 1.7% |
| imp | 6 | 1.7% |
| bam | 5 | 1.4% |
| bsd | 5 | 1.4% |
| cop | 5 | 1.4% |
| bbd | 5 | 1.4% |
| yer | 5 | 1.4% |
| szl | 5 | 1.4% |
| Other values (134) | 302 |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 96 | 9.0% |
| S | 76 | 7.1% |
| R | 75 | 7.0% |
| P | 71 | 6.6% |
| K | 66 | 6.2% |
| B | 62 | 5.8% |
| L | 59 | 5.5% |
| N | 53 | 5.0% |
| A | 53 | 5.0% |
| M | 51 | 4.8% |
| Other values (15) | 406 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1068 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 96 | 9.0% |
| S | 76 | 7.1% |
| R | 75 | 7.0% |
| P | 71 | 6.6% |
| K | 66 | 6.2% |
| B | 62 | 5.8% |
| L | 59 | 5.5% |
| N | 53 | 5.0% |
| A | 53 | 5.0% |
| M | 51 | 4.8% |
| Other values (15) | 406 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1068 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 96 | 9.0% |
| S | 76 | 7.1% |
| R | 75 | 7.0% |
| P | 71 | 6.6% |
| K | 66 | 6.2% |
| B | 62 | 5.8% |
| L | 59 | 5.5% |
| N | 53 | 5.0% |
| A | 53 | 5.0% |
| M | 51 | 4.8% |
| Other values (15) | 406 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| D | 96 | 9.0% |
| S | 76 | 7.1% |
| R | 75 | 7.0% |
| P | 71 | 6.6% |
| K | 66 | 6.2% |
| B | 62 | 5.8% |
| L | 59 | 5.5% |
| N | 53 | 5.0% |
| A | 53 | 5.0% |
| M | 51 | 4.8% |
| Other values (15) | 406 |
| Distinct | 143 |
|---|---|
| Distinct (%) | 40.6% |
| Missing | 148 |
| Missing (%) | 29.6% |
| Memory size | 4.0 KiB |
| LTL | 7 |
|---|---|
| MAD | 6 |
| ALL | 6 |
| JMD | 6 |
| PHP | 6 |
| Other values (138) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1056 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 35 ? |
|---|---|
| Unique (%) | 9.9% |
Sample
| 1st row | SVC |
|---|---|
| 2nd row | BRL |
| 3rd row | TVD |
| 4th row | BDT |
| 5th row | JMD |
Common Values
| Value | Count | Frequency (%) |
| LTL | 7 | 1.4% |
| MAD | 6 | 1.2% |
| ALL | 6 | 1.2% |
| JMD | 6 | 1.2% |
| PHP | 6 | 1.2% |
| FJD | 5 | 1.0% |
| EUR | 5 | 1.0% |
| SOS | 5 | 1.0% |
| BTN | 5 | 1.0% |
| TMT | 5 | 1.0% |
| Other values (133) | 296 | |
| (Missing) | 148 |
Length
| Value | Count | Frequency (%) |
| ltl | 7 | 2.0% |
| all | 6 | 1.7% |
| jmd | 6 | 1.7% |
| php | 6 | 1.7% |
| mad | 6 | 1.7% |
| tmt | 5 | 1.4% |
| mdl | 5 | 1.4% |
| btn | 5 | 1.4% |
| sos | 5 | 1.4% |
| eur | 5 | 1.4% |
| Other values (133) | 296 |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 117 | 11.1% |
| S | 75 | 7.1% |
| R | 68 | 6.4% |
| L | 67 | 6.3% |
| M | 56 | 5.3% |
| N | 56 | 5.3% |
| P | 54 | 5.1% |
| T | 52 | 4.9% |
| G | 52 | 4.9% |
| A | 51 | 4.8% |
| Other values (16) | 408 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1056 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 117 | 11.1% |
| S | 75 | 7.1% |
| R | 68 | 6.4% |
| L | 67 | 6.3% |
| M | 56 | 5.3% |
| N | 56 | 5.3% |
| P | 54 | 5.1% |
| T | 52 | 4.9% |
| G | 52 | 4.9% |
| A | 51 | 4.8% |
| Other values (16) | 408 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1056 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 117 | 11.1% |
| S | 75 | 7.1% |
| R | 68 | 6.4% |
| L | 67 | 6.3% |
| M | 56 | 5.3% |
| N | 56 | 5.3% |
| P | 54 | 5.1% |
| T | 52 | 4.9% |
| G | 52 | 4.9% |
| A | 51 | 4.8% |
| Other values (16) | 408 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1056 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| D | 117 | 11.1% |
| S | 75 | 7.1% |
| R | 68 | 6.4% |
| L | 67 | 6.3% |
| M | 56 | 5.3% |
| N | 56 | 5.3% |
| P | 54 | 5.1% |
| T | 52 | 4.9% |
| G | 52 | 4.9% |
| A | 51 | 4.8% |
| Other values (16) | 408 |
Auto
The auto setting is an interpretable pairwise column metric of the following mapping:- Variable_type-Variable_type : Method, Range
- Categorical-Categorical : Cramer's V, [0,1]
- Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
- Numerical-Numerical : Spearman's ρ, [-1,1]
This configuration uses the recommended metric for each pair of columns.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| SRC_SYS_ID | FRS_BU | FRS_AFFL_CD | ACTG_UNIT_ID | GOC | MNGD_SEG | BASE_CCY_AMT | FNCT_CCY_AMT | ENTRPS_PROD_CD | TXN_CCY_AMT | CITI_LV | EFF_DT | BKG_DT | ib_flag | segr_flag | FRS_ACCOUNT_CLASS | GAAP_TYP_CD | FNCT_CCY | TXN_CCY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 97268.0 | 37123.0 | NaN | NaN | 12486250.0 | 54926.0 | NaN | NaN | 1448.0 | -34587.403 | NaN | 01/14/2011 | 03/10/2011 | N | Y | OVDFT-LIAB | US_GAAP | GHS | SVC |
| 1 | NaN | 34759.0 | NaN | NaN | NaN | 61614.0 | NaN | NaN | 1236.0 | 159254.204 | 11927.0 | 12/12/2007 | 11/12/2007 | Y | CASH | NaN | BND | BRL | |
| 2 | NaN | 32522.0 | NaN | 944.0 | NaN | 43359.0 | NaN | NaN | NaN | 336960.326 | NaN | 10/15/2008 | 10/05/2008 | Y | N | OVDFT-LIAB | US_GAAP | BND | NaN |
| 3 | 142220.0 | NaN | NaN | 186.0 | 12484279.0 | NaN | NaN | -1175084.544 | 1408.0 | 120884.520 | NaN | 09/12/1999 | 09/09/1999 | Y | N | NaN | US_GAAP | BBD | NaN |
| 4 | 34162.0 | 38463.0 | 11483.0 | 1958.0 | NaN | 72428.0 | -138661.589 | NaN | 1350.0 | -127826.876 | NaN | 08/05/2010 | 08/11/2010 | N | CASH | US_GAAP | MZN | TVD | |
| 5 | 47786.0 | NaN | NaN | 3752.0 | 12482521.0 | 34306.0 | NaN | -1016383.664 | 1172.0 | 290891.549 | 12450.0 | 11/05/2020 | 12/15/2020 | N | Y | OVDFT-LIAB | COM_GAAP | NaN | BDT |
| 6 | NaN | 44086.0 | 14003.0 | 4864.0 | NaN | 23853.0 | NaN | NaN | NaN | 398256.869 | NaN | 01/23/2017 | 01/12/2017 | NaN | N | NaN | COM_GAAP | SYP | JMD |
| 7 | NaN | 42688.0 | 11792.0 | 41.0 | NaN | 34018.0 | NaN | 1224781.103 | 1333.0 | 238877.829 | 12792.0 | 02/06/2019 | 02/18/2019 | N | N | CASH | COM_GAAP | JMD | KHR |
| 8 | 73774.0 | NaN | 14275.0 | 3182.0 | NaN | NaN | -29812.952 | -856057.738 | NaN | 366843.369 | NaN | 01/28/2002 | 03/09/2002 | NaN | NaN | NaN | LBP | PHP | |
| 9 | 58469.0 | 29995.0 | NaN | NaN | NaN | 53440.0 | -87900.843 | NaN | 1214.0 | -29383.891 | NaN | 06/18/2019 | 08/01/2019 | Y | NaN | NaN | I_GAAP | ILS | UZS |
| SRC_SYS_ID | FRS_BU | FRS_AFFL_CD | ACTG_UNIT_ID | GOC | MNGD_SEG | BASE_CCY_AMT | FNCT_CCY_AMT | ENTRPS_PROD_CD | TXN_CCY_AMT | CITI_LV | EFF_DT | BKG_DT | ib_flag | segr_flag | FRS_ACCOUNT_CLASS | GAAP_TYP_CD | FNCT_CCY | TXN_CCY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 490 | NaN | 15043.0 | 14353.0 | 619.0 | NaN | 5941.0 | NaN | 489960.565 | 1395.0 | 278881.667 | 16165.0 | 01/18/2013 | 01/12/2013 | NaN | Y | NaN | LCL_GAAP | UAH | BOB |
| 491 | NaN | 19916.0 | NaN | 2287.0 | 586922586.0 | 28092.0 | NaN | NaN | NaN | 445452.056 | 11964.0 | 08/16/1999 | 09/29/1999 | N | Y | CASH | I_GAAP | NPR | NaN |
| 492 | NaN | 12070.0 | NaN | NaN | NaN | 23375.0 | NaN | NaN | NaN | -352708.853 | 15604.0 | 08/15/2020 | 10/02/2020 | N | Y | CASH | COM_GAAP | NaN | NaN |
| 493 | 36308.0 | 27451.0 | 14895.0 | 4270.0 | 586921092.0 | 25708.0 | -26955.582 | -449970.456 | 1370.0 | 27380.713 | 12376.0 | 05/30/2007 | 07/24/2007 | Y | N | CASH | NaN | NaN | KMF |
| 494 | NaN | 17191.0 | 14761.0 | NaN | NaN | 13227.0 | NaN | NaN | 1190.0 | NaN | 14659.0 | 04/12/2005 | 05/27/2005 | N | N | OVDFT-LIAB | NaN | NaN | NaN |
| 495 | 61654.0 | 19601.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 31436.407 | NaN | 10/04/2011 | 11/03/2011 | NaN | N | CASH | COM_GAAP | CLP | XPF |
| 496 | 114495.0 | NaN | NaN | NaN | NaN | NaN | NaN | 133706.304 | NaN | -232272.959 | 11396.0 | 08/18/2022 | 09/20/2022 | Y | Y | OVDFT-LIAB | NaN | MGA | OMR |
| 497 | 137934.0 | 39219.0 | 14734.0 | 2723.0 | 586921420.0 | 25509.0 | 106852.742 | -372890.565 | 1135.0 | 409500.148 | 13527.0 | 06/30/1998 | 06/23/1998 | Y | Y | CASH | COM_GAAP | FJD | RSD |
| 498 | 79237.0 | 48723.0 | 12709.0 | 4681.0 | 586921822.0 | 29707.0 | -208253.208 | -689971.564 | 1436.0 | 508128.698 | 18261.0 | 10/08/2014 | 12/05/2014 | NaN | NaN | CASH | I_GAAP | KES | NaN |
| 499 | 90611.0 | 48524.0 | 11512.0 | 1683.0 | 586922913.0 | 14890.0 | 16183.745 | 105384.818 | 1434.0 | -9386.869 | 12682.0 | 07/08/1999 | 07/16/1999 | Y | NaN | NaN | LCL_GAAP | ZMW | PEN |